Picture for Jieyu Zhang

Jieyu Zhang

Which Agent Causes Task Failures and When? On Automated Failure Attribution of LLM Multi-Agent Systems

Add code
Apr 30, 2025
Viaarxiv icon

Nemotron-Research-Tool-N1: Tool-Using Language Models with Reinforced Reasoning

Add code
Apr 25, 2025
Viaarxiv icon

Discovering Knowledge Deficiencies of Language Models on Massive Knowledge Base

Add code
Mar 30, 2025
Viaarxiv icon

Generate Any Scene: Evaluating and Improving Text-to-Vision Generation with Scene Graph Programming

Add code
Dec 11, 2024
Figure 1 for Generate Any Scene: Evaluating and Improving Text-to-Vision Generation with Scene Graph Programming
Figure 2 for Generate Any Scene: Evaluating and Improving Text-to-Vision Generation with Scene Graph Programming
Figure 3 for Generate Any Scene: Evaluating and Improving Text-to-Vision Generation with Scene Graph Programming
Figure 4 for Generate Any Scene: Evaluating and Improving Text-to-Vision Generation with Scene Graph Programming
Viaarxiv icon

Template Matters: Understanding the Role of Instruction Templates in Multimodal Language Model Evaluation and Training

Add code
Dec 11, 2024
Viaarxiv icon

TACO: Learning Multi-modal Action Models with Synthetic Chains-of-Thought-and-Action

Add code
Dec 10, 2024
Figure 1 for TACO: Learning Multi-modal Action Models with Synthetic Chains-of-Thought-and-Action
Figure 2 for TACO: Learning Multi-modal Action Models with Synthetic Chains-of-Thought-and-Action
Figure 3 for TACO: Learning Multi-modal Action Models with Synthetic Chains-of-Thought-and-Action
Figure 4 for TACO: Learning Multi-modal Action Models with Synthetic Chains-of-Thought-and-Action
Viaarxiv icon

ProVision: Programmatically Scaling Vision-centric Instruction Data for Multimodal Language Models

Add code
Dec 09, 2024
Viaarxiv icon

EcoAct: Economic Agent Determines When to Register What Action

Add code
Nov 03, 2024
Figure 1 for EcoAct: Economic Agent Determines When to Register What Action
Figure 2 for EcoAct: Economic Agent Determines When to Register What Action
Figure 3 for EcoAct: Economic Agent Determines When to Register What Action
Figure 4 for EcoAct: Economic Agent Determines When to Register What Action
Viaarxiv icon

Language Model Preference Evaluation with Multiple Weak Evaluators

Add code
Oct 14, 2024
Figure 1 for Language Model Preference Evaluation with Multiple Weak Evaluators
Figure 2 for Language Model Preference Evaluation with Multiple Weak Evaluators
Figure 3 for Language Model Preference Evaluation with Multiple Weak Evaluators
Figure 4 for Language Model Preference Evaluation with Multiple Weak Evaluators
Viaarxiv icon

xGen-MM (BLIP-3): A Family of Open Large Multimodal Models

Add code
Aug 16, 2024
Figure 1 for xGen-MM (BLIP-3): A Family of Open Large Multimodal Models
Figure 2 for xGen-MM (BLIP-3): A Family of Open Large Multimodal Models
Figure 3 for xGen-MM (BLIP-3): A Family of Open Large Multimodal Models
Figure 4 for xGen-MM (BLIP-3): A Family of Open Large Multimodal Models
Viaarxiv icon